Identification of CpG islands in DNA sequences using statistically optimal null filters

نویسندگان

  • Rajasekhar Kakumani
  • M. Omair Ahmad
  • Vijay Kumar Devabhaktuni
چکیده

: CpG dinucleotide clusters also referred to as CpG islands (CGIs) are usually located in the promoter regions of genes in a deoxyribonucleic acid (DNA) sequence. CGIs play a crucial role in gene expression and cell differentiation, as such, they are normally used as gene markers. The earlier CGI identification methods used the rich CpG dinucleotide content in CGIs, as a characteristic measure to identify the locations of CGIs. The fact, that the probability of nucleotide G following nucleotide C in a CGI is greater as compared to a non-CGI, is employed by some of the recent methods. These methods use the difference in transition probabilities between subsequent nucleotides to distinguish between a CGI from a non-CGI. These transition probabilities vary with the data being analyzed and several of them have been reported in the literature sometimes leading to contradictory results. In this article, we propose a new and efficient scheme for identification of CGIs using statistically optimal null filters. We formulate a new CGI identification characteristic to reliably and efficiently identify CGIs in a given DNA sequence which is devoid of any ambiguities. Our proposed scheme combines maximum signal-to-noise ratio and least squares optimization criteria to estimate the CGI identification characteristic in the DNA sequence. The proposed scheme is tested on a number of DNA sequences taken from human chromosomes 21 and 22, and proved to be highly reliable as well as efficient in identifying the CGIs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting CpG Islands and DNA Methlation in the Cow Genome Using DNA Microarray Meta-Analysis and Genome Wide Scanning

DNA methylation is a type of epigenetic changes that directly affects DNA. In mammals, DNA methylation is essential for fetal development and stem cell differentiation and this phenomenon essentially occurs within the CpG islands. In this study, two methods were used to study the DNA methylation profile of cow genome. In the first method, the DNA methylation profile of the differentially expres...

متن کامل

Identification of Cpg Islands Using a Bank of Iir Lowpass Filters

It has been known that biological sequences such as the DNA sequence display different kinds of patterns depending on their biological functions. This statistical difference can be exploited for identifying the region of interest, such as the protein coding regions or CpG islands, in a new biological sequence that has not been annotated yet. A region of particular interest is the epG island, wh...

متن کامل

Phylogeny of gazelles in some islands of Iran based on mtDNA sequences: Species identification and implications for conservation

Different species of gazelles are among the most endangered mammals on the Asian steppes and occur in the central, southern and northwestern regions of Iran. The previous conservation efforts in this region have been incomplete due to confusion about the phylogenetic relationship among various populations. So that, different conservation programs such as ex-situ breeding and transfer of captive...

متن کامل

PRIMEGENS-v2: genome-wide primer design for analyzing DNA methylation patterns of CpG islands

MOTIVATION DNA methylation plays important roles in biological processes and human diseases, especially cancers. High-throughput bisulfite genomic sequencing based on new generation of sequencers, such as the 454-sequencing system provides an efficient method for analyzing DNA methylation patterns. The successful implementation of this approach depends on the use of primer design software capab...

متن کامل

Study of promoter CpG island hypermethylation of cyclindependent kinase inhibitor gene p21waf1/cip1 on some breast carcinoma cell lines

The p21 belongs to the CIP/KIP family of CDK inhibitors involved in cell cycle arrest at specific stages of the cell cycle progression. DNA methylation is the best studied epigenetic mark that have been evidently associated to chromatin condensation, and repression of gene transcription. The CpG island hypermethylation in promoter region of certain genes occurs in cancer cells and affects tumor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2012  شماره 

صفحات  -

تاریخ انتشار 2012